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5 Zoea Specification Language 

Zoea Specification Language (ZSL) is a text-based notation used to 
describe a program as a set of test cases. A ZSL program consists of one 
or more terms where each term has the form <Tag> : <Value>. Terms must be 
separated by white space but layout is not significant. ZSL supports the 
following set of tags: program, use, data, case, step, input, derive, and 
output. The mandatory program tag defines the program name. The optional 
use tag can identify one or more existing programs to include. An 
optional data tag allows static reference data to be embedded within a 
program. 

One or more test cases are introduced by the case tag, which is optional 
for a single case. Each case can have any number of optionally identified 
steps where a step contains a single input, derive or output tag. Data, 
input, derive and output values are basically JSON data although quotes 
are optional for strings that contain no special characters. Listing 1 
provides a simple example of a ZSL program. 


program: is day of week 
data: [monday,tuesday,wednesday, thursday, friday, saturday, sunday] 
case: 1 input: thursday output: day_of week 
case: 2 input: ‘MONDAY’ output: day _of week 
case: 3 input: banana output: not day of week 


Listing 1. Example ZSL Program 


6 Zoea Visual 

Zoea Visual is a visual programming language for describing Zoea test 
cases that is built on top of ZSL (see Fig. 1). In Zoea Visual all data 
is represented visually as input fields, tables, etc. Elements are placed 
in named columns that correspond to ZSL tag names. Zoea Visual supports 
any number of input and output values. 

Zoea Visual introduces the concept of dependencies, which are data flow 
links between pairs of elements in different columns. These are used to 
indicate that one or more source elements are involved in some way in the 
production of a target element. This allows the user to construct a data 
flow diagram for each test case that provides more detail about the user 
intention to the compiler. 
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Fig. 1. Example Zoea Visual Program 


Zoea Visual also introduces a further form of composition called 
subsidiary test cases. A subsidiary test case allows the code for a given 


element to be elaborated as a completely separate Zoea Visual program. 
This facility allows complex logic and multiple code paths to be 
expressed using a linear rather than exponential number of test cases. 


7 Zoea Architecture 

Zoea uses a version of the blackboard architecture in which many 
knowledge sources update and react to changes in a central data store. 
There are a number of differences between the Zoea blackboard and the 
classical model: 
e Knowledge sources are distributed across any number of physical 
computers; 
e Knowledge sources work on local copies of relevant subsets of the 
blackboard data that are synchronised in both directions through deltas; 
e The central copy of the blackboard data is mainly used for data 
integration and redistribution; 
e Global scheduling involves the allocation of time and resource budgets; 
e Activated knowledge sources act concurrently within their allocated 
budgets; 
e Control knowledge that relates to a specific knowledge source is part 
of the 
nowledge source rather than an external scheduler. 

The knowledge sources are built on top of a distributed computing 
framework that allows arbitrary compute jobs to be unicast or multicast 
to a pool of workers located on any number of physical nodes. A central 
controller monitors the activity of all workers, and distributes jobs 
including knowledge source activations and blackboard updates as 
required. Modern engineering provides high performance and scalability. 
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8 Knowledge Representation 

In Zoea test cases are used as the focal point for all problem solving. 
At the highest level the test cases that are provided by the user express 
the problem of determining the code that will transform each test case 
input into the corresponding output. 
Between the test cases and solutions the blackboard abstraction levels 
reflect the general stages of the problem solving process which are: 

e Identify features in test cases; 

e Apply test cases to knowledge sources; 
e Create new values from existing values; 
e Find code fragments for a value; 
e Find partial solutions for a single case; 

e Find complete solutions for a single case; 

e Find a set of case solutions that work for all cases; 

e Generate the raw code for a solution set; 

e Generalise the raw solution set code. 

The entries on the Zoea blackboard do not form a simple linear hierarchy. 
Rather, the creation of synthetic test cases and the knowledge source 
activations form a large and multiply recursive tree in which the code 
elements of any solution are both sparsely distributed and also mutated 


as a result of synthetic test case transformations. 


9 Discussion and Future Work 
ZSL and Zoea Visual are novel, highly declarative and simple programming 
languages. CIP also represents a completely new softwar development 


paradigm. 

A set of ZSL and Zoea Visual code examples has been released [4], and the 
complete ZSL language specification is also available in the public 
domain [5]. ZSL code is similar in terms of size but approximately half 
the complexity of equivalent code in a conventional language. In addition 
ZSL has only 25-30% of the language complexity of the simplest 


conventional languages such as Go or Python. 


Both ZSL and Zoea Visual have been used internally to create programs 
that are the equivalent to hundreds of lines of code in conventional 
languages. Automated testing with randomised programs has produced 
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hundreds of thousands of programs - many of which are equivalent to over 
thousand lines of conventional code. 

he construction of Zoea involved no element of training nor the use of 
y third party source code. It therefore avoids any imputations of 
ntellectual property misuse. 
hile the compiler is functional work continues to improve performance 
and to provide more end user tooling. In particular we want to make 
hypotheses visible to the user, allowing them to be optionally accepted 
or rejected. 
The synthetic test case transformations carried out by the knowledge 
sources could be defined in more abstract terms. This might be useful for 
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reasoning about interaction between knowledge sources, for example to 
identify any redundant activations. 
Currently Zoea is run on a dedicated cluster of physical servers. While 
this is suitable for development and testing it limits access for other 
purposes. At some stage we would like to make Zoea more publicly 
available and this will require migration to some form of cloud-based 


infrastructure. 
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10 Conclusions 

Zoea is the first inductive programming system that was designed to 
support the creation of code of arbitrary size. It is a knowledge-based 
system that integrates many diverse knowledge sources using a blackboard 
architecture. Knowledge source activations are driven by pattern 
recognition and test cases ar used as a ubiquitous basis for all 
reasoning. In giving rise to simple, novel programming languages and 
processes, Zoea also demonstrates the potential for AI to revolutionise 
software development. 
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